Constructing Zero-deficiency Parallel Prefix Circuits of Minimum Depth

نویسندگان

  • HAIKUN ZHU
  • CHUNG-KUAN CHENG
  • Haikun Zhu
چکیده

A parallel prefix circuit has n inputs x1, x2, . . . , xn, and computes the n outputs yi = xi • xi−1 • · · ·•x1, 1 ≤ i ≤ n, in parallel, where • is an arbitrary binary associative operator. Snir proved that the depth t and size s of any parallel prefix circuit satisfy the inequality t + s ≥ 2n − 2. Hence, a parallel prefix circuit is said to be of zero-deficiency if equality holds. In this paper, we provide a different proof for Snir’s theorem by capturing the structural information of zero-deficiency prefix circuits. Following our proof, we propose a new kind of zero-deficiency prefix circuit Z(d) by constructing a prefix circuit as wide as possible for a given depth d. It is proved that the Z(d) circuit has the minimal depth among all possible zero-deficiency prefix circuits.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Cyclic Segmented Parallel Prefix

The cyclic segmented parallel prefix (CSPP) circuit is a varation on parallel prefix. Whereas ordinary parallel prefix computes prefix sums of a vector from the beginning, CSPP allows the starting point to move arbitrarily, with the data “wrapping around.” The wraparound is widely useful. We have used CSPP to redesign many components of a superscalar processor to run in time logarithmic in the ...

متن کامل

Depth-Size Trade-Offs for Parallel Prefix Computation

A prefix circuit has n inputs xi, . , x, , and computes the n outputs xi 0 . . . 0 xi, i=l , . . , n, where 0 is an associative operation. It is shown that the depth t and the size s of parallel prefix circuits are related by the inequality t + s 2 2n 2. This is true even if arbitrary binary operations can be performed at each node. For 2lgn-2<t<n-loptimalcircuitswith t+s=2n-2arebuilt.Thedepth ...

متن کامل

A new approach to the design of optimal parallel prefix circuits

Parallel prefix is one of the fundamental algorithms in computer science. Parallel prefix networks are used to compute carries in fast addition circuits, and have a number of other applications, including the computation of linear recurrences and loop parallelization. A new construction, called Slices, for fan-out-constrained depth size optimal (DSO) parallel prefix circuits is presented. The c...

متن کامل

On the complexity of parallel prefix circuits

It is shown that complexity of implementation of prefix sums of m variables (i.e. functions x1 ◦ . . . ◦ xi, 1 ≤ i ≤ m) by circuits of depth dlog2 me in the case m = 2n is exactly 3.5 · 2 − (8.5 + 3.5(n mod 2))2bn/2c + n + 5. As a consequence, for an arbitrary m an upper bound (3.5 − o(1))m holds. In addition, an upper bound ( 3 3 11 − o(1) ) m for complexity of the minimal depth prefix circuit...

متن کامل

A novel logarithmic prefix adder with minimized power delay product

Parallel prefix addition is a technique for speeding up binary addition. Classical parallel prefix adder structures developed so far are optimized for logic depth, area, fan-out and interconnect count of logic circuits. Due to continuing integrating intensity and growing needs of portable devices, low-power and high-performance designs are of prime importance. A new technique, proposed for perf...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005